Empirical likelihood inference for regression parameters when modelling hierarchical complex survey data
نویسندگان
چکیده
The data used in social, behavioural, health or biological sciences may have a hierarchical structure due to the natural structure in the population of interest or due to the sampling design. Multilevel or marginal models are often used to analyse such hierarchical data. The data may include sample units selected with unequal probabilities from a clustered and stratified population. Inferences for the regression coefficients may be invalid when the sampling design is informative. We apply a profile empirical likelihood approach to the regression parameters, which are defined as the solutions of a generalised estimating equation. The effect of the sampling design is taken into account. This approach can be used for point estimation, hypothesis testing and confidence intervals for the subvector of parameters. It asymptotically provides valid inference for the finite population parameters under a set of regularity conditions. We consider a two–stage sampling design, where the first stage units may be selected with unequal probabilities. We assume that the model and sampling hierarchies are the same. We treat the first stage sampling units as the unit of interest, by using an ultimate cluster approach. The estimating functions are defined at the ultimate cluster level of the hierarchy.
منابع مشابه
Empirical Likelihood for Estimating Equations with Nonignorably Missing Data.
We develop an empirical likelihood (EL) inference on parameters in generalized estimating equations with nonignorably missing response data. We consider an exponential tilting model for the nonignorably missing mechanism, and propose modified estimating equations by imputing missing data through a kernel regression method. We establish some asymptotic properties of the EL estimators of the unkn...
متن کاملBayesian Inference for Spatial Beta Generalized Linear Mixed Models
In some applications, the response variable assumes values in the unit interval. The standard linear regression model is not appropriate for modelling this type of data because the normality assumption is not met. Alternatively, the beta regression model has been introduced to analyze such observations. A beta distribution represents a flexible density family on (0, 1) interval that covers symm...
متن کاملModelling complex survey data with population level information: an empirical likelihood approach.
Survey data are often collected with unequal probabilities from a stratified population. In many modelling situations, the parameter of interest is a subset of a set of parameters, with the others treated as nuisance parameters. We show that in this situation the empirical likelihood ratio statistic follows a chi-squared distribution asymptotically, under stratified single and multi-stage unequ...
متن کاملA Bayesian Nominal Regression Model with Random Effects for Analysing Tehran Labor Force Survey Data
Large survey data are often accompanied by sampling weights that reflect the inequality probabilities for selecting samples in complex sampling. Sampling weights act as an expansion factor that, by scaling the subjects, turns the sample into a representative of the community. The quasi-maximum likelihood method is one of the approaches for considering sampling weights in the frequentist framewo...
متن کاملLikelihood Based Inference for Quantile Regression Using the Asymmetric Laplace Distribution
To make inferences about the shape of a population distribution, the widely popular mean regression model, for example, is inadequate if the distribution is not approximately Gaussian (or symmetric). Compared to conventional mean regression (MR), quantile regression (QR) can characterize the entire conditional distribution of the outcome variable, and is more robust to outliers and misspecifica...
متن کامل